Quality Assurance for Document Image Collections in Digital Preservation
نویسندگان
چکیده
Maintenance of digital image libraries requires to frequently asses the quality of the images to engage preservation measures if necessary. We present an approach to image based quality assurance for digital image collections based on local descriptor matching. We use spatially distinctive local keypoints of contrast enhanced images and robust symmetric descriptor matching to calculate affine transformations for image registration. Structural similarity of aligned images is used for quality assessment. The results show, that our approach can efficiently asses the quality of digitized documents including images of blank paper.
منابع مشابه
A Fuzzy Logic Based Expert System for Quality Assurance of Document Image Collections
Huge document image collections in digital libraries are prone to reduced quality and require automatic quality assurance. This paper presents an approach for bringing together information automatically aggregated from a quality assurance tool and expert knowledge related to digital preservation. The main contribution of this work is the definition of fuzzy expert rules and the application of f...
متن کاملDuplicate Detection for Quality Assurance of Document Image Collections
Digital preservation workflows for image collections involving automatic and semi-automatic image acquisition and processing are prone to reduced quality. We present a method for quality assurance of scanned content based on computer vision. A visual dictionary derived from local image descriptors enables efficient perceptual image fingerprinting in order to compare scanned book pages and detec...
متن کاملAn Expert System for Quality Assurance of Document Image Collections
Digital preservation workflows for automatic acquisition of image collections are susceptible to errors and require quality assurance. This paper presents an expert system that supports decision making for page duplicate detection in document image collections. Our goal is to create a reliable inference engine and a solid knowledge base from the output of an image processing tool that detects d...
متن کاملPeople Mashing: Agile Digital Preservation and the AQuA Project
Manual quality assurance (QA) of digitised content is typically fallible and can result in collections that are marred by a variety of quality and access issues. Poor storage conditions, technology obsolescence and other unforeseen problems can also leave digital objects in an unusable state. Detecting, identifying and ultimately fixing these issues typically requires costly and time consuming ...
متن کاملAutomated Preservation: The Case of Digital Raw Photographs
In digital preservation, a common approach for preservation actions is the migration to standardized formats. Full validation of the results of such conversion processes is required to ensure authenticity and trust. This process of quality assurance is a key obstacle to achieving scalability for large volumes of content. In this article, we address the quality assurance process for the preserva...
متن کامل